Improved Protein Secondary Structure Prediction using a Intelligent HSVM Method with a New Encoding Scheme
نویسندگان
چکیده
Prediction of protein secondary structures is an important problem in bioinformatics and has many applications. Successful secondary structure predictions provide a starting point for direct tertiary structure modelling, and also can significantly improve sequence analysis and sequence-structure threading for aiding in structure and function determination. Now many secondary structure prediction methods routinely achieve accuracy (Q3) of about 70%. We believe this accuracy could be further improved by using a hybrid method as essential part of the prediction process. In this article, a hybrid SVM(HSVM) has been used to predict protein secondary structure based on the method of combining physicochemical properties of amino acid residues with position-specific scoring matrices (PSSM) containing evolutionary information. Secondary structure can be predicted at significantly increased accuracy. Using a knowledge discovery theory based on inner cognitive mechanism (KDTICM) method, we have developed a gradually enhanced, multi-layered prediction system to predict protein secondary structure, compound pyramid model (CPM). The results are found to be superior to those produced by other methods with blind test dataset from the CASP9 meeting, including the popular psipred method according to Q3 and SOV99 accuracy. The results show that our method has strong generalization ability. The CPM website is accessible at http://kdd.ustb.edu.cn/protein _Web/.
منابع مشابه
Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملA new security proof for FMNV continuous non-malleable encoding scheme
A non-malleable code is a variant of an encoding scheme which is resilient to tampering attacks. The main idea behind non-malleable coding is that the adversary should not be able to obtain any valuable information about the message. Non-malleable codes are used in tamper-resilient cryptography and protecting memories against tampering attacks. Many different types of non-malleability have alre...
متن کاملA Fuzzy Expert System for Predicting the Performance of Switched Reluctance Motor
In this paper a fuzzy expert system for predicting the performance of a switched reluctance motor has been developed. The design vector consists of design parameters, and output performance variables are efficiency and torque ripple. An accurate analysis program based on Improved Magnetic Equivalent Circuit (IMEC) method has been used to generate the input-output data. These input-output data i...
متن کاملExploring protein fold space by secondary structure prediction using data distribution method on Grid platform
MOTIVATION Since the newly developed Grid platform has been considered as a powerful tool to share resources in the Internet environment, it is of interest to demonstrate an efficient methodology to process massive biological data on the Grid environments at a low cost. This paper presents an efficient and economical method based on a Grid platform to predict secondary structures of all protein...
متن کاملIncorporating Global Information into Secondary Structure Prediction with Hidden Markov Models of Protein Folds
Here we propose an approach to include global structural information in the secondary structure prediction procedure based on hidden Markov models (HMMs) of protein folds. We first identify the correct fold or 'topology' of a protein by means of the HMMs of topology families of proteins. Then the most likely structural model for that protein is used to modify the sequence of secondary structure...
متن کامل